Directly estimating non-classicality 
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We establish a method of directly measuring and estimating non-classicality — operationally defined in terms 
of the distinguishability of a given state from one with a positive Wigner function. It allows to certify non- 
classicality, based on possibly much fewer measurement settings than necessary for obtaining complete tomo- 
graphic knowledge, and is at the same time equipped with a full certificate. We find that even from measuring 
two conjugate variables alone, one may infer the non-classicality of quantum mechanical modes. This method 
also provides a practical tool to eventually certify such features in mechanical degrees of freedom in opto- 
mechanics. The proof of the result is based on Bochner's theorem characterizing classical and quantum charac- 
teristic functions and on semi-definite programming. In this joint theoretical-experimental work we present data 
from experimental optical Fock state preparation, demonstrating the functioning of the approach. 



Where is the "boundary" between classical and quantum 
physics? Unsurprisingly, acknowledging that quantum me- 
chanics is the fundamental theory from which classical prop- 
erties should emerge in one way or the other, instances of this 
question have a long tradition in physics. Possibly the most 
conservative and stringent criterion for non-classicality of a 
quantum state of bosonic modes is that the Wigner function — 
the closest analogue to a classical probability distribution in 
phase space — is negative, and can hence no longer be inter- 
preted as a classical probability distribution Cmi. From this, 
negativity of other quasi-probability distributions, familiar in 
quantum optics, such as the P-function 1 1 , 4| follows. In fact, 
a lot of experimental progress was made in recent years on 
preparing quantum states of light modes that exhibit such non- 
classical features, when preparing number states, photon sub- 
tracted states, or small Schrodinger cat states (SHU. At the 
same time, a lot of effort is being made of driving mesoscopic 
mechanical degrees of freedom into quantum states eventually 
showing such non-classical features |10|. All this poses the 
question, needless to say, of how to best and most accurately 
certify and measure such features. 

In this work, (I) we demonstrate that, quite remarkably, 
non-classicality in the above sense can be detected from mere 
measurements of two conjugate variables. For a single mode, 
this amounts to position and momentum detection, as can be 
routinely done by homodyne measurements in optical sys- 
tems. (II) What is more, using such data (or also data that are 
tomographically complete) one can get a direct and rigorous 
lower bound to the probability of operationally distinguishing 
this quantum state from one with a positive Wigner function — 
including a full certificate. Such a bound uses information 
from possibly much fewer measurement settings than needed 
for full quantum state tomography. At the same time, quantum 
state tomography using Radon transforms for quantum modes 
is overburdened with problems of ill-conditioning: This gives, 
strictly speaking, rise to the situation that when fully recon- 
structing a state based on such tomographically complete data, 
one often should expect to encounter such large error bars that 
the resulting state would well be also consistent with having 
had no non-classicality at all. 

The method introduced here, in contrast, is a direct method 
giving rise to a certified bound which arises from conditions 



all classical and quantum characteristic functions have to sat- 
isfy as being grasped by the classical and quantum Bochner's 
theorem |1 1 1. Hence, we ask: ''What is the least non-classical 
state consistent with the data^'l Intuitively speaking, the proof 
circles around the deviation of a quantum characteristic func- 
tion as the Fourier transform of the Wigner function from a 
classical characteristic function. This deviation can then be 
formulated in terms of a semi-definite program — so a well- 
behaved convex optimization problem — giving rise to certi- 
fiable bounds. The same technique can also be applied to 
notions of entanglement, and indeed, the rigor applied here 
reminds of applying quantitative entanglement witnesses. In 
this sense, one can directly certify quantum properties with 
much less than full tomographic knowledge. What is more, 
the criterion evaluation procedure is efficient. At present, such 
techniques should be most applicable to systems in quantum 
optics, and we indeed implement this idea in a quantum opti- 
cal experiment preparing a field mode in a non-classical state. 
Yet, they should be expected to be helpful when eventually 
certifying that a mesoscopic mechanical system has eventu- 
ally reached quantum properties ifTOl , where "having achieved 
a non-classical state", with careful error analysis, will consti- 
tute an important benchmark. 

Measure of non-classicality. - Non-classicality is most rea- 
sonably quantified in terms of the possibility of operationally 
distinguishing a given state from a state that one would con- 
ceive as being classical. That is to say, the meaningful notion 
of distinguishing a state from a classical one is as follows. 

Definition 1 (Measure of non-classicafity) Non-classicality 
is measured in terms of the operational distinguishability of a 
given state from a state having a positive Wigner function, 



V{p) = min||p- 

a;GC 



■will 



(1) 



where C denotes the set of all quantum states with positive 
Wigner function and \\ • ||i is the trace norm. 

This measure is indeed the operational definition of a non- 
classical state — as long as one accepts the negativity of the 
Wigner function as the figure of merit of non-classicality. 
Needless to say, the operational distinguishability with respect 



to other properties would also be quantified by trace-distances, 
and naturally several quantities of such a type can be found in 
the literature (see, e.g., Ref. 1 12|). By definition, it has among 
others the following natural properties: 

(a) It is invariant under passive and active linear transfor- 
mations. 

(b) It is non-increasing under Gaussian channels, and in fact 
under any operation that cannot map a state with a positive 
Wigner function onto a negative one. 

The latter property is an immediate consequence of the 
trace norm being contractive under completely positive maps. 
Moreover since Gaussian states are positive this measure 
of negativity gives also a direct lower bound to the non- 
Gaussianity of the same state. 
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FIG. 1. (a) Position and momentum distributions for an exact cat 
state, (b) Wigner function based on the output of the SDR 

Characteristic functions and Bochner's theorems. - We 
consider physical systems of n bosonic modes, associated 
with canonical coordinates R = (g'l, . . . , g'n,^!, • • • ,Pn), of 
"position" and "momentum", or some quadratures. In the cen- 
ter of the analysis will be quantum characteristic functions 
I2I [T3l . For n modes, the quantum characteristic function 
X : H^^ ^ C is defined as 
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so as the expectation value of the Weyl or displacement oper- 
ator, where the matrix 
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reflects the canonical commutation relations. This characteris- 
tic function is nothing but the Fourier transform of the familiar 
Wigner function W : R^^ -^ R, given by 
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A key tool in the argument will be the notion of X-positivity 
of a phase space function 1,11,1 . 

Definition 2 (A-positivity) A function x ' ^^^ -^ C is said 
to be X-positive definite for A G R if for every tti G N and 



for every set of real vectors T = {£,1^ £,2^ - - - ^ £,m) the m x m 
matrix M^^^ (x, T) with entries 

(mW(x,T))^^ =x(a-6)e'^«'''^«'/', (5) 

is non-negative, so M^^\x'> T) > 0. 

Eq. ([2]) defines the characteristic function of a given quan- 
tum state. In turn, one can ask for a classification of all func- 
tions that can be characteristic functions of a quantum state, 
or some probability distribution in the classical case. Such 
a characterization is captured in the quantum and classical 
Bochner's theorems [llj. The following assertions follow: 

(i) Every characteristic function of a quantum state must be 
1 -pos itive defin ite . 

(ii) Every characteristic function of a quantum state with a 
positive Wigner function must be at the same time 1 -positive 
definite and -pos itive definite. 

Measuring non-classicality. - Data are naturally taken as 
slices in phase space, resulting from measurements of some 
linear combinations of the canonical coordinates, as they 
would be obtained from a phase sensitive measurement such 
as homodyning in quantum optics. One collects data from 
measuring observables Uk • R for some collection of Uk G R^^ 
with \\uk\\ = 1. For example, in the simplest case of one mode 
one could measure only q and p or, if the state is phase invari- 
ant, one could average over all the possible directions. With 
repeated measurements one can estimate the associated prob- 
ability distributions P/c : R ^ R+, related to slices of the 
characteristic functions by a simple Fourier transform 
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Actually, in a real experiment one can build only a statisti- 
cal histogram rather than a continuous probability distribu- 
tion. As a consequence, measurements of values of the char- 
acteristic function must be equipped with honest error bars. 
An estimate of this error is given by (see Appendix) 



(7) 



S{uj) = \uj\h^n/VN, 



where 2h is the width of each bin of the histogram, N is the 
number of measurements and n is the number of standard de- 
viations that one should consider depending on the desired 
level of confidence (lAJ . This kind of measurements can be 
performed also in opto -mechanical systems where a particular 
quadrature of a mechanical oscillator can be measured a pos- 
teriori by appropriately homodyning a light mode coupled to 
the mechanical resonator 1 15|. A different idea has recently 
been proposed for directly pointwise measuring the character- 
istic function of a mechanical mode coupled to a two-level 
system [1I61I . In both cases the method that we are going 
to describe can be easily applied. Restricted measurements 
also arise in the context of bright beams ifTvl . where Mach- 
Zehnder interferometers have to replace homodyning in the 
absence of the possibility of having a strong local oscillator. 
In the study of states of macroscopic atomic ensembles iTSl 
similar issues arise. 



Bounds to the non- das sic ality from convex optimization. - 
We assume that we estimate the values of the characteristic 
function x(Cj) — Cj for a given set of points fj, j = 1, . . . ,p, 
within a given error (^j > 1 14|, so that for all j 
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Now pick a set of suitable test vectors T = (^i, . . . , ^m), the 
differences S,j — ^k of which at least contain the data points 
^i,. . . ,^p. Based on this, we define the following convex op- 
timization problem as a minimization over x, x, 

minimize x, (9) 

such that |Re(x(C)) " Re(cj)| < 6j, j = 1,. . . ,p, (10) 

|Im(xfe)) - Im(c,)| < S,, j = 1, . . . ,p,(ll) 

M(°)(x,T)+a;ml >0, (12) 

M(i)(x,T)>0, (13) 

where M^^^ (x, T) and M^^^ (x, T) are the Hermitian matrices 
^ associated with the A-positivity, based on the test points 
^1 7 • • • 7 ^m as being specified in Def . [2] The minimization 
is in principle performed over all functions x • ^^^^ ~^ ^ 
such that x(~0 — x(0*' where x(6) is constrained by the 
data and M*^^/^^(x,T) depend on the test points. Since we 
take only a finite number of points of x, yet, the above prob- 
lem gives rise to a semi-definite problem (SDP) 1 19|. This 
can be efficiently solved with standard numerical algorithms. 
What is more, by means of the notion of Lagrange duality, 
one readily gives analytical certifiable bounds to the optimal 
objective value: Every solution for the dual problem will give 
a proven lower bound to the primal problem 1 19], and hence 
a lower bound to the measure of non-classicality itself. The 
entire procedure hence amounts to an arbitrarily tight convex 
relaxation of the Bochner constraints. We can now formu- 
late the main result: Eq. (|9| gives rise to a lower bound for 
the non-classicality: Given the data (and errors), one can find 
good and robust bounds to the smallest non-classicality that 
is consistent with the data. 

Theorem 3 (Estimating non-classicality) The output x' of 
Eq. (M) is a lower bound for the non-classicality, r]{p) > x' . 

The proof proceeds by constructing a witness operator 

where ^i, . . . , ^m ^ ^^^ are the test vectors from Bochner's 
theorem used in the SDP and v is the normalized eigenvec- 
tor associated with the minimum eigenvalue of M^^\x' •, T), 
where x' is the optimal solution for x- For a given state p, this 
operator F has the following properties: 

1. F = Ft, 

2. |tr(Fcj)| < 1 for all quantum states uj. 

3. tr(Fco') > for all quantum states u e C. 

4. If x^ > is the optimal solution, then tr(Fp) < — x^ 
This can be seen as follows: 1. follows directly from con- 
struction, noting that D{^) = D{-^y for ^ G R^"^. Property 



2. follows from \tT{D{^)u)\ = |x(OI ^ 1' valid for every 
characteristic function, from which we find 



^ m 

HFco)\<-J2 klhl 



k,l = l 




<1, 



where we have used that for every normalized vector 
XlHi \^i\ ^ m^/^, as can easily be seen by solving a 
quadratic problem. To show 3., we can exploit the property 
of the characteristic function of cj of being 0-positive definite. 
This implies that 
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where ^^^^^^(x, T) is the Bochner matrix associated with the 
characteristic function x of cj. Finally, 4. originates from the 
structure of the SDP as a convex optimization problem. For 
optimal solutions x' and x^ the constraint ( 12 ) implies that the 
minimum eigenvalue of M*^^^ (x^ T) is equal to —x'm. Then 

^AFp) < J:Z=i vlviMi°^{x',T)/m = -x' . 

The four properties suggest that F is actually a witness ob- 
servable able to distinguish a subset of non-classical states 
from the convex set of classical states. Formally, from the 
variational definition of the trace norm, we have 



Vip) =min||p- 



■ uj\\i > mintr(cjF) - tr(Fp) > x\ (16) 

oj^C 



which is the lower bound to be shown. 

An example: Schrodinger cat state. - As an example we 
consider a quantum superposition of two coherent states, so 
lip) ^ {\a) -\- \ — a)) with a = 1.77. We assume to measure 
only the probability distributions of position and momentum 
(Fig.[l]a): Pi{q) = \ {q\^) \^ ^nd P2{p) = |(p|V^)p, that is 
the data is collected from a mere/7a/r of canonical operators. 
This amount of information is of course not sufficient for to- 
mographically reconstructing the state since it corresponds to 
just two orthogonal slices of the characteristic function. 

In order to define the SDP we consider a 25 x 25 square 
lattice centered at the origin of the domain of the characteris- 
tic function and we optimize over the values of x at the lattice 
points. Position and momentum measurements can be used 
to define constraints ([TOjTT]) for only two slices of the lattice 



and we assume an error of 5j = 10 ~ for each point. We 
generate 100 random test vectors and we construct the associ- 
ated A— positivity constraints ( T2p3 ). The output of the SDP 
is x' 0:^ 0.05 > which is a certified lower bound for the 
non-classicality of the state. 

Experimentally detecting non-classicality. - Finally, to cer- 
tify the functioning of the idea in a quantum optical con- 
text, we apply our method to experimental data. We con- 
sider data from a heralded single-photon source based on 
parametric down-conversion (cf. Ref. |9 1). Here, one of the 
two down-converted photons is detected and heralds the pres- 
ence of the other photon. The heralded single photon is sam- 
pled 180, 000 times by homodyne measurements at phase- 
randomized quadratures (Fig.[2]a). Instead of using these data 
in tomography we will use the same data to directly extract a 
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FIG. 2. (a) Raw measured quadrature distribution from the experi- 
ment without any correction, (b) Related Wigner function based on 
the output of the SDR 



certified lower bound to the non-classicality of the state. We 
will show that the most classical state consistent with those 
data has a negative Wigner function (Fig. |2]a) which is close 
to the one reconstructed with a maximum likelihood method 
in Ref. (91 . This strengthens the finding of a negative Wigner 
function, as we hence directly certify it as a worst case bound 
including error bars. Data correspond to phase randomized 
measurements meaning that we can use the same probability 
distribution for every phase space direction, the phase being 
not available in this experiment. Since our non-classicality 
measure is convex, averaging over phase space directions is 
an operation which can only decrease the negativity of the 
state. This means that a lower bound to the non-classicality of 
the phase randomized state will be valid for the original state. 
In order to apply our algorithm we use the measured data 
to constrain all the points of the characteristic function on a 
37 X 37 lattice. Error bars are estimated using Eq. ^ with 
n = b standard deviations. This means that the probability 



that all the points of the lattice lie inside the error bars is larger 
than 99.9%. The lower bound for the non-classicality coming 
out from the SDP (200 random test vectors have been used) is 
larger than zero x' c^ 0.0028, meaning that the Wigner func- 
tion of the state cannot be a positive probability distribution. 
Indeed, the Wigner function reconstructed from the optimal 
solution of the SDP (Fig. [2]b) is clearly negative even if we 
asked for the most positive one consistent with measured data. 

Extensions of this approach. - Needless to say, this ap- 
proach can be extended in several ways. Indeed, the method 
can readily be generalized to produce lower bounds for en- 
tanglement measures 1201 in the multi-mode setting. Also, 
this idea can be applied to the situation when not slices are 
measured, but points in phase space, such as when using a 
detector-atom that is simultaneously coupled to a cantilever 
fW\ . It would also constitute an interesting perspective to see 
how the present ideas can be used to certify deviations from 
stabilizer states for spin systems, being those states with a 
positive discrete Wigner function [21 1. 

Summary. - We have introduced a method to directly mea- 
sure the non-classicality of quantum mechanical modes, re- 
quiring less information than tomographic knowledge, but 
at the same time in a certified fashion. In this way, these 
ideas are further advocating the paradigm of "learning much 
from little" — getting much certified information from few 
measurements — complementing methods of witnessing en- 
tanglement 1 20, 22], recent ideas of compressed sensing 1231 
or matrix-product based 1241 approaches to quantum state to- 
mography, detector tomography |[25]| , or the direct estimation 
of Markovianity f26l of a continuous process. It is the hope 
that this paper further stimulates work in the context of this 
paradigm. 
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I. SUPPLEMENTARY INFORMATION (EPAPS) 

In this appendix, we add a discussion on measurement er- 
rors not essential for the main text. We assume data to be 
organized in histograms made of bins of width 2h centered at 
points Sj = 2hj. The normalized height of the j-th bin is 
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where {wi} are random variables with zero average asso- 
ciated to the statistical error. In a standard homodyne de- 
tection, {wj} have Poissonian variances equal to (wjWi) = 

dj^i J^^_f^ Pk{s)ds/N, where N is the total number of mea- 
surements. The simplest approximation for the value of the 
characteristic function defined in Eq. ([6]) is 



X^{oocJUk) = ^Pie''^'^ 






The error will be the sum of two terms 



Xiujcruk) -X^'i^f^Uk) 
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The first term is the error due to the finite resolution of the 
histogram, which is bounded by |A| < h\uj\, while w = 
^- Wjc'^^^^ is the random statistical error. This error could 
be estimated and possibly reduced by using bootstrapping or 
maximum likelihood algorithms. For our purposes, we sim- 
ply calculate the standard deviation of the random variable w, 
which is given by ((|^P))^^^ = 1/VN. Now, for the central 
limit theorem w can be approximated by a Gaussian distribu- 
tion, meaning that the probability that the following tube con- 
straint is violated is exponentially suppressed in the number 
of considered standard deviations n (T4\ . 



Ixi^auk) - X^{^(^Uk)\ <h\uj\^ n/VN. (20) 



